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ABSTRACT 

We present a cosmography analysis of the Local Universe based on the recently released 
Two-Micron All-Sky Redshift Survey (2MRS). Our method is based on a Bayesian Net- 
works Machine Learning algorithm (the KlGEN-code) which self-consistently samples 
the initial density fluctuations compatible with the observed galaxy distribution and 
a structure formation model given by second order Lagrangian perturbation theory 
(2LPT). From the initial conditions we obtain an ensemble of reconstructed density 
and peculiar velocity fields which characterize the local cosmic structure with high 
accuracy unveiling nonlinear structures like filaments and voids in detail. Coherent 
redshift space distortions are consistently corrected within 2LPT. From the ensemble 
of cross-correlations between the reconstructions and the galaxy field and the variance 
of the recovered density fields we find that our method is extremely accurate up to k ~ 
1 /iMpc _1 and still yields reliable results down to scales of about 3-4 h~ x Mpc. The 
motion of the local group we obtain within ~ 80 h~ l Mpc (^lg = 522 ± 86 krns" 1 , 
^lg = 291° ± 16°,6lg = 34° ± 8°) is in good agreement with measurements derived 
from the CMB and from direct observations of peculiar motions and is consistent with 
the predictions of ACDM. 
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1 INTRODUCTION 

The Local Universe (LU) harbours the link between the 
early Universe and our present day cosmic environment. A 
profound analysis of its cosmic structure is thus essential 
to gain insight into the processes which lead to structure 
formation in our surroundings and ultimately to our own 
Galaxy. This is one of the main go als performing constrained 
simulations of the LU (see e. g. Bist olas fc Hoffman 1998; 
iKravtsov et~ai1l2002l : iKivpin et al. 2003). To carry out this 
kind of studies one needs first to recover the initial fluctua- 
tions which gave rise to the galaxy distribution we observe 
in our neighbourhood. However, such a task implies vari- 
ous complications since in general a galaxy redshift survey 
provides a discrete sample of biased matter tracers which 
are degraded by observational effects like a radial selection 
function, due to a magnitude limit cut, and redshift-space 
distortions caused by peculiar motions with resp ect to the 
Hubble flow (see discussion in lPlaten et alJl2Qll[ and refer- 
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ences therein) . For very nearby structures one may use mea- 
surements of radial velocities to p artially avoid these prob- 
lems (see e. g. lZaroubi et al.lll999h . However, one still needs 
to trace the observations back in time unfolding gravity 
which couples matter on different scales in a nonlinear and 
nonlocal way. While linear approaches are useful to recon- 
struct the baryon acoustic oscillations signal on very large 
scales ([Eisenstein et al]|2007T ) , they fail on small scales when 
the nonlinear regime becomes relevant. One may solve the 
boundary problem of finding the initial Lagrangian positions 
of galaxies by mini mizing an action as suggested in differ- 
ent works (see e. g. [Pe ebles 1989; Nusser fc Branching 2000; 



Branchini et al. 112002: Brenier et al. 2003|). The determina- 
tion of the initial conditions (and hence of the displacement 
field) automat ically yields estimates on the peculiar velocity 
field (see also iMohavaee fe Tullvl liooHl : ILavaux et ai1l2008l . 
in addition to the previously cited works) . The mismatch in 
the measurement of the Local Group ( LG) velocity from the 
cosmic microwave background (CMB) (jHinshaw et al.ll2 009) 
and f rom the matter distribution in the LU fsee lBilicki et al.l 
l201lL and references therein) together with the recent claims 
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Figure 1. On the left: Normalised cross-power spectra XP(k) = 

( 1 5 rec (k)58 al (k) | ) / ( V^ rec (k) v/Ps al (fc) ) between the galaxy over- 
density and the reconstructed density field with 1 and 2 sigma 
contours (light and dark shaded regions, respectively). On the 
right: power spectrum of the galaxy field P gal (/c) (dashed curve), 
linear ACDM power spectrum (red curve), mean of the 100 re- 
constructed linear power-spectra of the initial field (black curve) 
with 1 and 2 sigma contours (light and dark shaded regions, re- 
spectively). 



on large-scale flows which could challenge the standard cos- 
mological model ( or give some hint s on the initial pertur- 
bations of the Univ erse, see e. g. Kas hlinskv et al.l |2008; 
Watkins et al ] l2QQ9h stresses the need for more accurate 
studies of the local dynamics. It should be noted that the 
number of solutions compatible with the observations is de- 
generate due to shell crossi ngs and redshift-space distortions 
(see e. g. lYahil et al.|[l99lh . Moreover, the approaches men- 
tioned above do not provide yet the initial fluctuation field. 
At this point another assumption needs to be made on the 
statistics of the initial fluctuations. From these arguments 
it should be clear that the estimation of the initial con- 
ditions corresponding to a galaxy distribution in redshift- 
space has a stochastic nature which should be treated in a 
statistical way to accurately model the propagation of uncer- 
tainties. For this reason we suggest to extend the Bayesian 
works based on line ar Gaussian fields ( see the pioneering 
work s during the 90s: [H offman & Ribak 1991; Zaroubi et al 



1995; Fisher et al 



1995; 



1996; 1 Zaroubi et al.l 1999; 



van d e Wevga ert Bertschinger 
Schmoldt et al.lll999h and apply 



a Bayesian Networks Machine Learning approach including 
a nonlinear and nonlocal model for structure formation pro- 
viding an ensemble of reconstru cted initial an d final density 
and peculiar velocity fields (see iKit aural T2012) . This ensem- 
ble of solutions enables us to estimate in a realistic way 
the uncertainties in the reconstruction and cross-check the 
accuracy of the method with the observations, improving 
previous work concerning single "optimal" solutions limited 
to linear Eulerian or Lagrangian perturbation theory. 



2 DATA, METHOD AND RESULTS 

The analysis presented here is based on version 2.3 of the re- 
cently rele ased 2MASS redshi ft survey (2MRS), K s = 11.75 
catalogue ([Huchra et al.ll2012h . The 2MRS survey is unique 
in its sky coverage (91%) and uniform completeness (97.6%) 
only limited near the Galactic plane (the Zone of Avoidance 
(ZoA), where |6| < 5° and |6| < 8° near the Galactic cen- 
tre). The ZoA could be sampled with a Poissonian likeli- 
hood describing the counts-in-cells of the galaxy distribu- 



tion which would be lim ited to a coarse grid resolution (see 
e. g. Kitaura et al . 2010). To incorporate a mask treatment 
within a particle based reconstruction method like ours, we 
would need to sample mock galaxies (or haloes) according 
to our structur e formation model following schemes like the 
one proposed in lScoccimarro &~S heth ( 2002), which is out of 
scope in this work. For the time being, we fill the ZoA with 
random galaxies generated from the co rresponding long i- 
tude/distance bins in the adjacent strips (jYahil et al .11 1991 



The method is robust for the width of the 2MRS mas k and 
has b een thoroughly tested (for details see lErdogdu et al.l 
2006, and references therein). According to recent studies, 
the LG veloci ty should be indep endent of the treatment of 
the mask (see Bilicki et al. 2011). Another essential ingredi- 
ent is the radial selection function / sel arising from the mag- 
nitude limit cut of the galaxy redshift survey. Here we derive 
/ sel from the 6 degree fie ld galaxy survey (6dFGS) luminos- 
ity function (| Jones et al.ll2006h , imposing a magnitude limit 
cut which corresponds to the one of 2MRS. However, we 
note that t he choice of the particu lar derivation of / sel is not 
crucial (see lBranchini et al.ll2012i ). The behaviour of / sel can 
be further checked in the power-spectra of the reconstructed 
initial fluctuations P Tec (k) (see below). Finally, we compress 
the fingers- of- god after identifying them with a friends-of- 
friends algorithm taking into account the ellipsoidal distri- 
bution along the l ine-of-sight of group s of galaxies due to 
virial motions fsee lTegmark et al ll2QQ4h . The resulting cat- 
alog provides the Cartesian three-dimensional positions of 
the 2MRS galaxies in Supergalactic coordinates including 
coherent redshift-space distortions. 

This catalog is used as an input for the KlGEN- 
code which relies on 2LPT to descri be structure formation 
(see recent works on this subject: [K itaura & Angulo 2011; 
iKitaura et aDl201ll : [jasche fe Wand elt 2012; Kitaura 2012, 



and references therein). We assume Gaussian initial density 
fields with a variance determined by the cosmological param- 
eters from the concordance ACDM-cosmology as provided 
by the Seven- Year Wilkinson Microwave A nisotropy Probe 
(WM AP) (see table wmap7+bao+h0 in iKomatsu et al.l 
1201 lh . In this study we consider the data within a comoving 
box of 160 h~ x Mpc side (comprising about 30,000 galaxies). 
Coherent redshift-space distortions a re consi stently modeled 
with 2LPT (beyond the Kaiser limit; Kaiserl l 19871 ) by adding 
a radial term to the Eulerian real-space position x = q-\- 
where q is the Lagrangian position of the matter tracers 
at the initial conditions and \l/ is the displacement field 
according to 2LPT. The redshift-space position s of each 
matter tracer is thus given by the following equation in our 
model s — q + \l/ + v r , with v r = (v • r)r/(Ha), where v 
is the full three dimensional 2LPT velocity field, r is the 
unit sight line vector, H the Hubble constant and a the 
scale factor. The KiGEN-code samples the initial fluctua- 
tions according to the set of Lagrangian test particles {q} 
which under 2LPT yields a distribution in Eulerian redshift- 
space {s} compatible with the observed galaxies {sg}. We 
use in each constrained 2LPT simulation 256 3 test particles 
and compute the displacement field on a grid of 128 3 cells 
with a resolution of l c = 1.25 h~ x Mpc. Particles which are 
closer than l c to a galaxy are considered to be "friends" of 
that galaxy and their Lagrangian positions are used as con- 
straints to determine the initial Gaussia n fluctuation s (for 
more details on the method we refer to Kitaura 2012). We 
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Figure 2. Supergalactic XY-plane (SGZ=0) with ~20 Mpc thickness of a 128 3 grid with 160 h~ x Mpc side (resolution of 1.25 
Mpc): left panel: logarithm of the reconstructed density field ln(2 + 5 rec ) (one sample belonging to the highly correlated subsample of 21 
reconstructions which have a cross-correlation with the galaxy overdensity better than 1 sigma at scales ^ 3.5 Mpc) with overplotted 
observed galaxies in red and augmented ones in blue; middle panel: logarithm of the mean density field of all 100 samples ln(2+ (5 rec )), 
right panel: v x — v y velocity field with the underlying galaxy overdensity field after 3.5 h~ x Mpc Gaussian smoothing. The length of the 
arrows is proportional to the average speed at that location. 
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Figure 3. Contour plot: left panel and middle panel: corresponding to the left and middle panels in Fig. [2] respectively; right panel: 
logarithm of the signal-to-noise ratio of the mean density field (S/N= (5 rec ) /a(S rec )) in each cell from all samples in the same slice as 
the other panels. 



account for selection function effects and shot noise in our 
reconstruction by assigning in each iteration a weight to ev- 
ery particle according to a Poisson sample of the expected 
mass given by the inverse of / sel evaluated at the distance 
of the corresponding galaxy. We consider for our analysis 
a set of 100 constrained 2LPT reconstructions of the LU, 
with the corresponding initial conditions and peculiar ve- 
locity fields. Our approach searches for Gaussian fields with 
a given prior power-spectrum (red curve in the right panel of 
Fig.[T]). In Bayesian terms, the posterior from which we sam- 
ple the initial density fluctuations is based on the above men- 
tioned prior and a Poissonian likelihood for the test particles 
which we assume to be unbiased matter tracers. However, as 
the data are s parse and nois y and have a moderate galaxy 
bias (see e. g. lLavauxl l2010T h the posterior resulting from 
weighting the likelihood with the prior yields closely unbi- 



ased power spectra. This is also indicating that our treat- 
ment of the selection function is accurate. Otherwise a few 
modes on the very large scales would show a clear excess of 
power as we found in our tests. We note, that further studies 
should be done including a proper treatment of galaxy bias. 
It is difficult to do a fair comparison between the density field 
and the galaxy field due to bias and selection function ef- 
fects. However, we find a remarkable correlation up to k ~ 1 
/iMpc -1 between the constrained 2LPT simulated overden- 
sity field (^ rec = N? art /N part - ! 5 with iV part : test particles 
number count per cell i, N part ; mean) and the one directly 
computed from the galaxies (5f al = Nf &1 / (N** 1 ff el ) - 1, 
with Nf al : galaxy number count per cell i, N sal ; expected 
mean) (see left panel) . The correlation does not vanish until 
k ~ 2.5 /iMpc -1 . This result is supported by the Super- 
galactic plots in Fig. [2] (left and middle panels), which show 
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40-50 h- 1 Mpc 




Figure 4. Mollview plots in Galactic coordinates using HEALPix of the number counts of spectroscopic 2MASS galaxies. On the left 
with n s ide — 64 in different comoving distance ranges. The middle panels (right panel on the bottom) show the corresponding logarithm 
of number counts of particles from the reconstructed density field (one sample belonging to the highly correlated subsample) using 
™side — 128. The right panels (not present on the bottom) show the corresponding radial velocities averaged on each pixel with the 



number counts of particles in km s using n ; 



side : 



128. We note that the Galactic longitudes ■ 



^al: 



have a shift of 360°. 



how the nonlinear structures are accurately traced along the 
distribution of galaxies at scales of 2-5 Mpc, even in re- 
gions with only of a few data points. The right panel shows 
the corresponding peculiar velocity field to a high level of 
detail demonstrating the formation of caustics in the high 
galaxy number density regions. We expect to get better 
results using an improved structure formation model, espe- 
cially in low density regions where 2LPT is known to per- 
form worse (see e.g. iBernardeau et al. I l2002h . We leave such 
a work for future studies. We have checked that systematic 
discrepancies between the reconstructed 2LPT density field 
and the galaxy field are rather small (we obtain a statisti- 
cal correlation coefficent of about 0.96 for the log-density 
field taking the volume of (160 h~ x Mpc) 3 ) when one allows 
for an effective, linear bias combined with a Poisson sam- 
pling of the field that accounts for both galaxy bias and 
the failure of 2LPT on small scales. A way of improving 
this woul d consist on using the halo model as it was sug- 
gested by (|Scoccimarro &; Sh eth 2002). We use the ensemble 
of constrained simulations to compute the mean and stan- 
dard deviation in each cell (see middle and right panel in 



Fig. [3}. This gives us an estimate of the uncertainty in the 
position of the density peaks. If we look at Coma which cen- 
tral region has an extension of a few h~ x Mpc and is located 
at (0,69,11) /i -1 Mpc in SG-coordinates, we find that the 
uncertainty in the position is only of about 2-3 h~ x Mpc. 
The major uncertainties concern the extension around very 
massive structures being overall very robust (with S/N>1 
in most of the cells, see right panel). The quality of the re- 
construction can be further assesse d in the Mollview p lots of 
Fig. H performed with HEALPix (|G6rski et all 120051 ). Here 
the data (left panels) can be compared to the reconstruction 
(middle and right panels) for different redshift slices. The ac- 
curacy of the reconstruction is very apparent to the level of 
connecting small number of galaxies through tiny filaments. 
On the right panels the radial peculiar veloci ty field shows 
simil ar patterns than in linear theory (see lErdogdu et al.l 
2006, for a careful description of the different structures), 
however reconstructing a significantly more complex struc- 
ture. The bottom panels in Fig. 2] show the full projection 
on the sky to a distance of 80 h~ x Mpc. The data (left panel) 
have a very noisy appearance while the reconstruction (right 
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panel) unveils the corresponding cosmic web0 Fig. [5] shows 
our calculations of the LG velocity by evaluating the recon- 
structed velocity field at the center of the box. The uncer- 
tainties in the estimation of the velocity field mainly depend 
on the randomness of the initial fluctuations and the shot 
noise due to the discrete galaxy sam ple including se l ection 
function effects. As it was shown in iKitaura et all (|201lh 
2LPT is extremely accurate in the determination of the pe- 
culiar velocity field. We therefore do not expect significant 
systematic errors beyond the fact that we are considering 
the matter inside a volume of (160 /i _1 Mpc) 3 . We repeat 
our calculations for a subsample of extremely highly corre- 
lated reconstructions with the galaxy field finding consistent 
results and hence demonstrating the robustness of our cal- 
culations. 




{ l LG =291±16 c 
l LG =293± 9° 



± b LG =34±8° 
b|_Q= 34± 4° 



If 
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Figure 5. Histograms for the speed -ulg an d the direction 
(Ilg, ^lg) (Galactic) of the Local Group for the 100 reconstructed 
samples (black lines) and the highly correlated subsample (see 
caption in Fig. [4j (dashed lines filled in blue color code). The 
corresponding means and 1 sigma deviations are indicated. 



3 DISCUSSION AND CONCLUSIONS 

We have performed a Bayesian cosmography analysis of 
the LU with the 2MRS galaxy redshift survey. Our ap- 
proach leads to more accurate estimates of the dynamics 
in the LU with respect to previous one s which either as- 
sume linear Eulerian relations (see e. g . iFisher et al.l ll995: 
lErdogdu et alJl200d IBilicki et al.ll201lh or linear LPT (see 
e. g. lLavaux et alT 2010l ). There are still a number of is- 
sues which should be further investigated, as for instance 
the impact of galaxy bias or full nonlinear evolution on the 
dynamics. To extend the study presented here to larger vol- 
umes one should include a treatment of the so-called rocket- 
effect as it becomes more relevant in regions where the se- 
lection effects are more severe, i. e. at lar ger distances to the 
observer (see e. g. iBranchini et al1l2012h . This effect could 
be iteratively corrected by re-computing the selection func- 
tion at the galaxy estimated real-space position at each re- 
constru ction step in our a lgorithm following the pioneering 
work of lYahil et all (|l99lh . Nevertheless, it is worth noting 
that we model for the first time nonlocal and nonlinear ef- 
fects in the calculation of the local cosmic flow including 
the tidal field component (within 2LPT) . To perform higher 
order LPT it is essential to get accurate estimates of the 
linea r component of the den s ity field as it was po inted out 
in IKitaura fe Angulol (|201lh : IKitaura et alJ (|201lh . This is 
done in this work in a self- consistent way by iteratively sam- 
pling the Gaussian fields compatible with the data within 
2LPT. In particular, we map local structures like the Lo- 
cal Void, the Local Supercluster (Virgo), the Coma Clus- 
ter, the Perseus Pisces Supercluster and the Great Attractor 
(Hydra and Centaurus) in great detail. Our results show a 
wide range of LG velocity amplitudes (from about 350 to 
600 kms -1 ). These findings indicate that the propagation 
of uncertainties is non-trivial and hence, the need for more 
studies including a nonlocal and nonlinear self- consistent 
treatment of gravity on different volumes (as indicated in 
lErdogdu fe Lahavl hoO^. The lower speeds than the ones 
from the CMB (16-18% lower) found in our study and the 
angle separation of about 20° ±10° indicate that the dipole 
has not converged yet considering the matter up to distances 



1 The figures are available in high resolution at: 
www . aip . de/People/ f kitaura/ 



of about 80 h- 1 Mpc. This is in good agreement with ACDM 
which predicts the dipole to have reached a t those scales 
abou t 70 to 80% of its tot al amplitude (see e. g. lLavaux et al.l 
bold IBilicki et al.ll201lh . Interestingly, the amplitude and 
direction of the LG velocity we find is compatible within 
the (1 sigma) error bars with the direct observation of pe- 
culiar motions (see ICourtois et all [2012). Our results are 
also consistent (within 2 s igma) with previou s studies (see 
e^_g. lErdogdu et al] 12003 lLavaux et aD l2010l : IBilicki et al.l 
l201lh . We hope that the outcome of works like the one pre- 
sented here leads to a variety of applications ranging from 
cosmic web analysis, environmental studies and signal de- 
tections like the kinematic Sunyaev-Zel'dovich effect, to ac- 
curate constrained simulations enabling us to improve our 
understanding on structure formation in the LU. 
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